Initialization of Optimized K-means Centroids Using Divide-and-conquer Method
نویسندگان
چکیده
K-means clustering algorithm is one of the most popular unsupervised learning algorithm that is broadly used to clustering the given data items. The k-means algorithm is one of the commonly used clustering methods in data mining. A number of algorithms have been developed for clustering the data items using K-Means due to its simplicity and efficiency. The final clustering result of the K-Means clustering algorithm highly depends upon the initial centroids, which are selected at random by the user. The difficulty of determining “the right number of clusters” in traditional K-Means clustering has attracted significant importance especially in the recent years. There are many improvement were already developed to get better performance of the k-means, but most of these methods needed other inputs like threshold values for the number of data points in a data set. In this work, the proposed algorithm can solve the problems of finding initial centroids and assigning data items to proper clusters using divide-and-conquer method. So in proposed method, the initial cluster centers have obtained using divide-and-conquer property after that K-Means algorithm is applied to gain optimal cluster centers in dataset. The proposed algorithm can improve the execution speed of clustering the data items using little number of iterations. With the help of mathematical calculations the proposed algorithm decreases the complexity which we face in k-means clustering algorithm.
منابع مشابه
Free Vibration Analysis of Repetitive Structures using Decomposition, and Divide-Conquer Methods
This paper consists of three sections. In the first section an efficient method is used for decomposition of the canonical matrices associated with repetitive structures. to this end, cylindrical coordinate system, as well as a special numbering scheme were employed. In the second section, divide and conquer method have been used for eigensolution of these structures, where the matrices are in ...
متن کاملA new Initial Centroid finding Method based on Dissimilarity Tree for K-means Algorithm
—Cluster analysis is one of the primary data analysis technique in data mining and K-means is one of the commonly used partitioning clustering algorithm. In K-means algorithm, resulting set of clusters depend on the choice of initial centroids. If we can find initial centroids which are coherent with the arrangement of data, the better set of clusters can be obtained. This paper proposes a meth...
متن کاملEfficient and Fast Initialization Algorithm for K- means Clustering
The famous K-means clustering algorithm is sensitive to the selection of the initial centroids and may converge to a local minimum of the criterion function value. A new algorithm for initialization of the K-means clustering algorithm is presented. The proposed initial starting centroids procedure allows the K-means algorithm to converge to a “better” local minimum. Our algorithm shows that ref...
متن کاملImproved COA with Chaotic Initialization and Intelligent Migration for Data Clustering
A well-known clustering algorithm is K-means. This algorithm, besides advantages such as high speed and ease of employment, suffers from the problem of local optima. In order to overcome this problem, a lot of studies have been done in clustering. This paper presents a hybrid Extended Cuckoo Optimization Algorithm (ECOA) and K-means (K), which is called ECOA-K. The COA algorithm has advantages ...
متن کاملBiBinConvmean : A Novel Biclustering Algorithm for Binary Microarray Data
In this paper, we present a new algorithm called, BiBinConvmean, for biclustering of binary microarray data. It is a novel alternative to extract biclusters from sparse binary datasets. Our algorithm is based on Iterative Row and Column Clustering Combination (IRCCC) and Divide and Conquer (DC) approaches, K-means initialization and the CroBin evaluation function [6]. Applied on binary syntheti...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016